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Abstract 

A common problem in particle physics is the require¬ 
ment to reproduce comparisons between data and theory 
when the theory is a (general purpose) Monte Carlo simu¬ 
lation and the data are measurements of final state observ¬ 
ables in high energy collisions. The complexity of the ex¬ 
periments, the obervables and the models all contribute to 
making this a highly non-trivial task. 

We describe an existing library of Fortran routines, HZ- 
Tool, which enables, for each measurement of interest, 
a comparable prediction to be produced from any given 
Monte Carlo generator. The HZTool library is being main¬ 
tained by CEDAR, with subroutines for various measure¬ 
ments contributed by a number of authors within and out¬ 
side the CEDAR collaboration. 

We also describe the outline design and current sta¬ 
tus of a replacement for HZTool, to be called Rivet (Ro¬ 
bust Independent Validation of Experiment and Theory). 
This will use an object-oriented design, implemented in 
C++, together with standard interfaces (such as HepMC 
and AIDA) to make the new framework more flexible and 
extensible than the Fortran HZTool. 

INTRODUCTION 

While the Standard Model provides methods of mak¬ 
ing some calculations to an impressive degree of precision, 
there are still many areas where first-principles calculations 
must be supplemented by a more phenomenological ap¬ 
proach. This is particularly the case in collisions involving 
hadrons, where the non-perturbative regime of QCD plays 
a major role, especially in the hadronisation stage, as do 
parton density functions, fragmentation and - especially at 
high energies - the underlying event, i.e. the part of the 
hadronic final state that is not coherent with the hard scat¬ 
ter. These effects are illustrated in figure [T] 

Calculations using only the well understood “hard” 
regime of QCD can provide only an incomplete test of the 
Standard Model and of theories that go beyond it. Mod¬ 
els that include these more complicated effects and provide 
a full description of the final state, generally implemented 
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Figure 1: A hadronic collision. 


as Monte Carlo event generators, fill a number of roles in 
high-energy physics. As well as providing predictions that 
can be used to test the agreement between theory and exper¬ 
iment, they are extensively used to correct for detector ef¬ 
fects in making measurements, to estimate the background 
to processes of interest, and to make predictions of both 
standard and exotic processes for use in planning analyses 
and designing future accelerators and detectors. 

The models used to describe these processes typically 
have a large number of free or weakly constrained parame¬ 
ters, which must be tuned to the data. This is a complicated 
task, as the same parameter may affect the predictions for 
different measurements in widely differing way. It is rel¬ 
atively easy to tune a parameter or limited set of parame¬ 
ters in order to provide a good description of a particular 
observable, only to spoil the agreement of the model with 
other data. It is therefore important to compare models si¬ 
multaneously to a wide range of data from different col¬ 
liding particles in different energy regimes, including older 
data from experiments that may no longer be running, but 
still constrain the range of acceptable models. 

It is to provide a solution to this problem that HZTool (T) 
has been developed, and Rivet |2j is now being created. 
These packages form an essential part of the wider pro¬ 
gramme of work being carried out by the CEDAR |[3j col¬ 
laboration. 

REQUIREMENTS 

In order to enable meaningful comparisons between data 
and Monte Carlo predictions, both HZTool and Rivet must 











incorporate routines or modules that encapsulate informa¬ 
tion about the meaning of the measured quantities that goes 
beyond the simple numerical values stored in the HEP- 
DATA El database. These routines must make the data 
available for future studies and model comparisons that 
may not have been possible when the data was published. 
The precise meaning of an observable and the procedure 
used to produce it may be hard to reconstruct from the pub¬ 
lished paper. It is therefore useful for the routine to be writ¬ 
ten, if possible, by the authors of the original analysis. 

The framework must cleanly separate the analysis rou¬ 
tine from any individual event generator, reading the event 
information only via a common interface. This means 
that the routine can be used unchanged with any genera¬ 
tor, rather than being restricted to those in use at the time 
the measurement was made and published. 

HZTool and Rivet provide essential low-level tools en¬ 
abling the comparison of theoretical predictions with ex¬ 
perimental data. They may be used as stand-alone pack¬ 
ages, but there are also a number of ways in which their 
functionality can be extended. JetWeb 15] uses HZTool as 
a back end (and will in future use Rivet) but adds a web 
interface and a database of predictions, and is described in 
another contribution to these proceedings 0. This reduces 
duplication of effort by enabling predictions generated by 
one user to be reused by others, but still requires the user to 
specify the model of interest including any parameter val¬ 
ues that differ from the defaults. There is clearly scope for 
higher-level tools that automatically vary parameters in or¬ 
der to find improved models. Professor 0 is an example 
of an attempt to do this. 

HZTOOL 

The HZTool framework, written in Fortran 77, was orig¬ 
inally developed as part of the workshop on Future Physics 
at HERA in 1997. It started with an emphasis on HERA 
results, but subsequently it has been expanded to cover a 
wider range of measurements. HZTool has recently be¬ 
come part of the CEDAR project, and some further devel¬ 
opment has taken place in order to improve its modularity 
and to bring together some of the different versions that 
were in use into a single easily available package. Devel¬ 
opment has been moved to the HepForge 0 environment. 

Each included experimental analysis has a correspond¬ 
ing Fortran subroutine in HZTool. This subroutine takes 
events from a Monte Carlo event generation run and fills 
histograms in order to produce data and plots that can be 
compared directly with the experimental results. In the 
most recent version, HZTool is purely a library of analysis 
routines and associated utilities such as jet finders, which 
may be used by a number of different routines. The main 
programs and steering routines that were formerly part of 
the package have been split off into a separate package, 
HZSteer, which is designed chiefly as an interface between 
JetWeb and HZTool, although it can also be used to run 
HZTool independently of JetWeb. This separation means 


that HZTool is completely independent of any Monte Carlo 
generators, with all such dependencies existing only in 
HZSteer. 

The HEPEVT common block is used in order to make 
the code independent of the particular generator used. His- 
togramming is done using the HBOOK histogramming 
package that forms part of the CERNLIB 0 library. 

As part of the CEDAR project, HZTool is being adapted 
to provide model descriptions in HepML El El format and 
to use HEPDATA as a source of experimental data rather 
than duplicating the information in Fortran DATA state¬ 
ments. 

Figure |2] shows some histograms created using HZTool 
as part of a study of Monte Carlo tuning for the Interna¬ 
tional Linear Collider IIII1121 . The continuous black line 
shows the predictions of the default HERWIG E3 model 
for dijet production in 77 events in e + e _ collisions at a 
centre-of-mass energy of 500 GeV. The other curves show 
the predictions of models using HERWIG with parame¬ 
ters tuned to a range of jet production data from HERA, 
LEP and the Tevatron. The results show the benefit of sys¬ 
tematic tuning, since the models that have been tuned to 
a wide range of data show reasonable agreement with one 
another, despite differing significantly from the generator 
default settings. 



Figure 2: Predictions of various HERWIG models for jet 
production in e + e _ collisions at a centre-of-mass energy 
of 500 GeV. From fill . 


RIVET 

Rivet uses an object-oriented design, implemented in 
C++, to make the framework easier to maintain than HZ¬ 
Tool, as well as making it easier to interact with the more 
recent C++ Monte Carlo programs. 

Wherever possible, interaction between Rivet and other 







software and data will use common interfaces rather than 
introducing unnecesary dependences on particular pack¬ 
ages. This includes the use of HepMC fl4l for accessing 
information about the generated events. It is important that 
when a new generator (or version) is released, its predic¬ 
tions can be compared to all data in the Rivet library with¬ 
out the need to modify each analysis routine. The use of 
HEPDATA for access to data and the HepML interface for 
input and output of Monte Carlo configurations is being 
incorporated in Rivet from the start. The creation of his¬ 
tograms will be dealt with using the AIDA EQ interface. 

Rather than reimplement jet finders and other general 
analysis utilities already available elsewhere. Rivet will 
make use of existing libraries such as KtJet fThl . Where 
suitable general-purpose (i.e. independent of any particular 
experiment’s framework) tools are not available, we will try 
where possible to make our solutions available and usable 
without depending on Rivet. 

As in the case of HZTool and HZSteer, the steering and 
all generator-dependent code is in a separate package, Riv- 
etGun, so that the Rivet library can be compiled without 
any generator code and does not need to be recompiled in 
order to use it with a new generator. 

Each physics paper implemented in Rivet will be rep¬ 
resented by an analysis “module”. The analysis modules 
have access to a collection of Projection classes, each of 
which calculates some property of an event. A projection 
may be built up from a series of simpler projections. For 
example, a projection that finds jets in an event might call 
on another projection to select particles in the event. 

Rivet will run roughly as follows: 

• A run will in general use a single generator with one 
combination of beam particles, energies and steering 
parameters. 

• A RivetHandler will be instantiated, and will be 
given a collection of Analysis objects to use, each 
corresponding to a physics paper with measurements 
to be compared to the generator predictions. 

• Analysis classes will inherit from a common base 
class, AnalysisBase. Each such class will use 
Projection objects to calculate event properties and 
implement kinematic cuts. 

• A Projection takes an Event and constructs some 
property of the event, which may be a single number 
(e.g. Q 2 ) or a more complex object such as a set of 
jets boosted to a particular coordinate frame. 

• Each Projection object is instantiated along with 
the Analysis at the beginning of a run, but only acts 
on the current event if and when it is actually asked for 
the information. Once it has carried out its calculation 
it caches the result, which can then be accessed again 
by the same Analysis or a different Analysis, thus 
saving a significant amount of time in the case of e.g. 
a jet algorithm. 


• Once an Analysis has used a series of Projections 
to calculate the quantities of interest and apply any 
necessary cuts, it uses the Rivet histogramming code 
to fill the relevant histograms. This will probably 
involve simply using the AIDA interfaces to invoke 
whatever histogramming package is chosen by the 
user. 

CONCLUSIONS 

HZTool has been actively used for eight years, both as a 
stand-alone application and as the back-end to the JetWeb 
interface. It will continue to be maintained as long as the 
knowledge encoded in its routines is not available in an¬ 
other form. However, active development will soon cease 
in favour of Rivet, and once Rivet is available collabora¬ 
tions will be encouraged to provide Rivet modules instead 
of HZTool subroutines. 

These frameworks provide a fundamental part of the 
CEDAR programme, making use of other CEDAR projects 
such as HEPDATA, and providing an input to higher-level 
tools such as JetWeb, and potentially to more automated 
tuning systems such as Professor. 

While a clear timescale is set by LHC start-up, and the 
short-term goals of CEDAR are directed towards this, the 
project will not stop there but will continue to provide a 
valuable resource for the Linear Collider and other future 
facillities. 
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